Method and device for modifying a zone in successive images

ABSTRACT

The process makes it possible to modify the content of a sequence of images which can be represented as a pixel matrix. For that a non-deformable target zone to be replaced in the images is identified and is characterized, both in position and in dimensions, by reference to marks located close to each other and having an unchanging geometrical relation with the target zone in each image. The position, with respect to the marks, to be assigned to a model or pattern which can be superimposed on the target zone, stored in digital form, is determined. For each particular image of a same shot the geometrical transformation to be carried out on the model so that it corresponds to variations in the appearance of the target zone in all the successive images in the same shot is determined automatically by a correlation based on the marks. Last the pixels representative of the model are substituted for the pixels representative of the target zone in the image.

The invention relates to a method and device for modifying a zone in successive images forming a sequence, representing an evolutionary scene, delivered by at least one camera.

The term "image sequence" must be interpreted in a wide sense as designating not only the image signal (analog or digital) produced in real time by a camera or a video controller but also a sequence recorded on an inscribable filing medium (digital or analogue), on a photographic, magnetic or optical medium, in a form allowing subsequent display, delivery or broadcasting at a rate equivalent to the real time rate.

More precisely, the invention relates to a method and device for replacing an image fraction corresponding to a precise region or to an object of the scene which will be named hereafter "target area", with a representation of another precise region, or another object, which is real or not, which will be named hereafter "pattern".

The fields of application of the invention are extremely various. The following may be mentioned, non limitatively

advertising, the method making it possible to add or replace advertising panels during transmissions or retransmissions of sporting events or others;

production of audio-visual works, particularly fictional works, the method then making it possible to replace the scenery during shooting of production with another scenery;

simulation, in which the method makes it possible to insert the representation of a real object in an environment formed by a synthetic image.

Numerous methods are already known which can be used for modifying a target area in an image. The target area may for example be identified by a particular colour, which is preferably unused in the rest of the image to avoid any ambiguity But this method does not take into account the image acquisition conditions, for example camera movements. The pattern cannot consequently follow the evolution of the scene satisfactorily.

In image synthesis a pattern may also be generated which is inlaid in a target area, but on condition that position, focal distance and focussing do not change.

These methods, as well as the use of superimposed shots for producing motion pictures associating real scenes and animated cartoon characters, have limitations which exclude use thereof or reveal the special effects whenever the image acquisition conditions vary significantly in time.

Last, there is known (EP-A-0 360 576) a method enabling to generate a composite sequence from two sequences of video pictures. An operator manually indicates at least all reference points which represent the corners of a polygon and a transformation circuit modifies the addresses of the pixels in the polygon for enabling combination.

That method requires a manual operation for each picture.

An object of the invention is to provide a method for modifying a same zone in successive images in a same sequence fulfilling the requirements of the technique better than those known heretofore, which can be used even when the acquisition conditions change, particularly centring, focal distance and focussing, by automatically taking this evolution into account.

For that purpose, inlaying of the image of the pattern is preceded by a phase of adaptation of that image so as to cause it to exactly match the representation of the target area in the image.

For that purpose the invention provides a method for modifying the content of a sequence of images representing an evolutionary scene and having a high degree of correlation between successive images of the same shot, the images being adapted representation as a matricial arrangement of pixels, characterized by: identifying a non-deformable target area to be replaced in the scene is identified and it is characterizing its position and dimensions by reference to reference marks related to and close to the target area in each image; determining the position, with respect to said reference marks, to give to a pattern superimposable on the target area, stored in digital form; for each particular image, automatically computing the geometrical transformation to be undergone by the pattern to make it correspond to the variations of appearance of the target area, but a correlation calculation based on the reference marks, in all successive images of the same shots; and substituting the pixels representative of the pattern in the image for the pixels representative of the target area.

The term "shot" designates an image sequence representing the same scene acquired by the same camera without sudden time or space disruption.

When the invention is carried out in delayed time, the marks may be manually indicated on the first image of each shot for easier computation. The operation may be carried out by pointing a light pen or on a table. In real time, the search may be carried in the whole image or, more frequently, only in a predetermined portion.

After the first image, the search by image processing is rendered easier since the movements of the target zone and the variations in size from one image to the next are limited by the characteristics of the camera. It is implemented by searching "characterizing points" (typically angles of contrast lines) which are distributed and whose representation is stored within the target zone. If for instance the target is an advertisement bill-board carrying alpha-numerical characters, the "characterizing points" can be graphemes which are particularly significant, such as junctions between vertical portions and transversal bars in certain letters. Recognition is then carried out on neighbourhoods whose size is for instance 6×8 pixels, using pattern recognition processes which are well known and which are sufficient whatever the scale of the image. When the image is affected with noise, it is preferable to search for 10 to 20 characterizing points for having a sufficient number of recognitions. Then, from the position of the characterizing points, at least four marks are derived which have a characterizing geometrical distribution, for instance the four edges of an advertising bill-board.

The invention can be applied even when the scene comprises fixed or mobile obstacles placed between the camera and the target area and which may be interposed between them; in this case, the parts of the image corresponding to the obstacles will be identified through their lack of correlation with the surroundings, on condition that the luminance and/or colour of the obstacle differs from that of the background; only that part of the target area which is not masked by the obstacle will be replaced.

In all cases, the method according to the invention uses space and time continuity of the scene during a same shot.

The invention also provides a device for implementing the above-defined method, comprising picture forming means delivering each image as of a matrix of pixels, the radiometric values (luminance and possibly chrominance) of each pixel being in digital form, further comprises:

means for storing reference marks for locating a target area in the image and representions of the target area on with a plurality of scales and for storing representations on the same scales, of a pattern which can be superimposed on the target area,

means for recognizing the reference marks in the current image and deriving therefrom the position and dimensions of the target area, and

means for automatically determining for each image, by spatial correlation calculation based on the reference marks, the geometrical transformation to which the pattern should be subjected to correspond to the variations of appearance of the target area, in all successive images of the same shot and for substituting, in the image, the pixels representive of the pattern for the pixels representative of the target area.

The invention will be better understood from the following description of a particular embodiment given by way of example. The description refers to the accompanying drawings in which:

FIG. 1 is a general diagram showing an observed scene, a target object, an obstacle and a shot-taking camera,

FIG. 2 is a representation of the image of the scene at successive moments,

FIGS. 3A and 3B form a flow chart of the method.

FIGS. 4A-4E constitute a flow sheet for implementing the method in particular conditions (images shot by a TV camera which may only rotate).

The method of the invention will be described in its application to the substitution of a pattern, formed by an advertising panel of fixed dimensions and location, for a target formed by another advertising panel. The two bill-boards or panels are rectangular and can be identified by the edges of each panel. "Characterizing points", typically 10 to 20 in number, are defined in the target and consist of graphemes which may easily be identified. If the target contains alpha-numeric characters, the graphemes are for instance crossings between bars, defined in a neighbourhood which is for instance of 6×8 pixels.

This application may be used in particular in the case of a sporting event broadcast in several countries with different advertising billboards.

PRELIMINARY STEPS

A preliminary step consists in selecting and storing the target, or the image area which contains it, in the form of a plurality of representations:

at different scales, throughout the whole variation range which may be covered by varying a focal distance of the camera, for example, at successive scales derived one from the other by multiplying by a root of two for rendering correlation easier;

possibly, with different perspective angles in the case of a target whose angular positions with respect to the camera may vary.

Then, with a computer, characterizing data bound to the target are extracted, for the different predetermined scales. The characterizing data can be angle points of the contrast line. They could be:

elements of the target itself, such as graphemes having a non-variable relative arrangement, angular points of the periphery, a distribution of colours,

and/or elements which have a non-variable position in the scene with respect to the target, surrounding the target and easily identifiable with the above-defined criteria.

An additional preliminary step consists either to select the "pattern" in a library of digital images or, if the pattern does not yet exist, to have it created by a designer, which may use a hardware-software device of the graphic pallet type (Hariette of Quantel -1990-, or TIPS on ATT Vista board -1988-) or still to generate it by digitizing a picture acquired with a TV camera or a scanner. Such a generation operation is conventional and out of the field of the invention.

From this image, several secondary images are generated which correspond to the same scales as those of the target.

ACQUISITION

For easying the requirements regarding computation speed, the device for implementing the invention typically will operate in delayed time, from stored images. For simpler illustration, FIG. 1 however shows the means for seizing the images and the modification device itself.

FIG. 1 shows by way of example a TV camera 10, which will be assumed to have a variable focal distance, directed towards a scene 12 containing a target 14 to be replaced, the target being in an invariable position with respect to two objects 16 and 18 and possibly being partially or wholly masked by an obstacle 20. Camera 10 is connected to a recorder, such as a magnetoscope, when transformation is carried in delayed time. When operating in real time, the camera directly feeds a computer 22 via an analogue-digital converter if it does not deliver the image in the form of a matrix of pixels each identified by a multi-bit word representing luminance and possibly chrominance values.

The method of the invention processes each shot independently of the others: for each shot, the successive images are acquired by the same camera 10 and have a high degree of space and time correlation, the evolution of the image being progressive. As shown in FIG. 2, the target area 14 may vary in dimensions and, in some images, may be masked by an obstacle 15 (images numbered 1, 2 and 3 in FIG. 2).

Then, the computer extracts data which characterize the target panel on the one hand, the pattern panel on the other hand, for the same scale.

The camera 10 may have means indicating, to computer 22, the variations of parameters related to the camera, such as the focal distance, for more readily matching characteristic elements of the target panel with corresponding elements found in the different representations of the moving element, stored or obtained by interpolation; full tracking of the target by that approach is however not satisfactory, due to unavoidable vibrations, error accumulation and need of a non standard video device. It is preferable to estimate the modification in the focal length of the camera and the elevation and bearing angles (consequently the angle under which the camera seizes the target panel) with a mathematical model.

The model may be set by using the x and y coordinates of the corners of the panel. It is particularly known that the variations of the focal length result in a variation along direction x of the type:

    X=ax+b                                                     (1)

and the angular variations (conical perspective) in:

    X=(cx+d)/(ex+f)                                            (2)

where a, . . . , f are constant values. Frequently model (2) may be modified as a second degree polynome.

The first operation carried out on the current image by computer 22 is the search for the target panel 14, by searching for the best possible match with the characterizing data stored. This search takes place by conventional methods of digital correlation, between characterizing points of the pattern, at the different scales, and the characterizing points extracted from each image. It takes into account that obstacles can mask some of the characterizing points.

The search may be simplified if it takes into account constraints:

of geometrical evolution, due to the device (since there are limitations to the speed of variation of the focal length and to the rotational speed);

of evolution of shapes imposed by the device or the scene.

When only transformations of the "geometrical similitude" type, caused by focal length variations with slight rotations, are to be taken into account, a geometrical deformation mathematical model is sufficient, of the type used for image-to-image diachronic remote detection (for instance implemented in the ATG System--Atelier de Traitement Geometrique du CNES -1986--available from Societe Europeenne de Propulsion). It is obtained by minimizing the rests as a polynomial function of degree one or two, as long as the lens of the camera do not introduce significant distortions, i.e. aberrations due to a target which is too close.

On the other hand, when large modifications of angular position occur, the geometrical deformation mathematical model introduces modifications in the shape of the target panel on the current image; then modelisation as a ratio of two polynomes, as indicated hereinabove, is used.

By low-pass time filtering on geometrical models resulting from several successive images (for instance arithmetic average on two to four images) of a same shot, the stability in time of the mathematical models, so their quality, may be estimated. It is then possible to select, among several models, the model which best represents the geometrical transformation. By using characterizing points which are in large number and distributed within the whole of a target, it is possible to identify the target even when points are behind an obstacle.

If the focal distance and orientation information are not supplied by sensors carried by the camera while the size of the target is known, the geometric transformation mathematical model also allows the focal distance and the elevation and bearing angles to be estimated.

By way of example, the characterizing data used for estimation of the focal length and angles may, in the case of a target formed by a horizontal rectangular advertising panel with a white background and a camera only having elevational and lateral deflection movements, be the four graphemes formed by the four corners of the panel. They may be marks of a particular form (such as dots, crossed lines) on a uniform background, (wall, cloth, . . . ) if the pattern is to be inlaid on such a background upon display: the distribution of the marks will then be homothetic of that of particular graphemes in the pattern.

COLORIMETRY

Once the geometric transformation mathematical pattern has thus been determined, it is generally necessary to estimate the colour transformation between the typical target panel (main image and representations) and the zone which corresponds thereto in the current image, i.e. to calculate a mathematical colorimetric transformation pattern. Such transformation is in most cases advisable, due to variations of lighting and in the response of the camera. The mathematical model is a linear transformation based on a comparison between only the neighbourhoods and points of the target panel serving for effective search for matching. The model may be elaborated by linear regression on the intensity and on the colour components R, G, B. It is in fact necessary to exclude the points which might belong to an interposed obstacle (images numbered 1, 2 and 3 in FIG. 2).

In this case again, a stability estimation time filtering is carried out for moving the influence of noise inherent in the quantification of the colours.

All these operations appear in the main path of the schematic flow sheet of FIG. 3A and, for each image, lead to generation of two geometrical transformations for passing from a stored representation of the target to the true image of the target area.

POSSIBLE EVALUATION OF THE MTF

At this stage, it is useful to estimate the modulation transfer function, i.e. the difference of sharpness between the target panel in the current image and its stored representation. For that, the frequency spectra of the neighbourhoods of the mutually corresponding points are compared. This analysis globalizes the image motion, i.e. the effect of movement, and the focussing defects.

TRANSFORMS

Then, the stored representation of the target panel is subjected to (1) the geometric transformation model, then (2) the shade transformation model and finally--under certain circumstances--(3) the modulation transfer function.

Each of these transformations is a linear transformation. So their applications may be combined.

At this step, a transformed current image of the sequence is available in which the target panel has been replaced with the representation of the target panel (previously generated image of the target model).

The pixels of the transformed current image which differ from the initial current image correspond to those points of the initial current image which represent something other than the target panel, thus to an obstacle interposed between the camera and the panel.

That set of points constitutes a mask associated with the obstacles. It will be possible to only inlay the portion of the pattern corresponding to the non-masked zone (opaque obstacle) or to locally modify the pattern before inlaying (semi-transparent obstacle). Substitution of the pattern is pixel per pixel.

In particular, a moving obstacle may create an effect of semi-transparency: an apertured obstacle (tennis net) or of small size (ball) may have the same effect.

To deal with such a case, the method of the invention uses, in a particular embodiment, a transparency map based on an a-priori evaluation or adjusted as a function of the real scene. For example, a degree of transparency equal to 1 is assigned to the non-masked points of the panel and a transparency equal to 0 (zero) is assigned to the points corresponding to the obstacles. For semi-transparent obstacles, or for rapidly moving objects, an intermediate value is assigned to the corresponding points. It is assumed that, on the scale at which the scene is observed, the non-transparent obstacles are compact, which makes it possible to spatially filter the transparency map by a conventional "closing" method of mathematical morphology.

The transparency map may be time filtered for fixed obstacles; it is also possible to time filter the rapidly moving obstacle, by assuming that their size varies slowly. Then conventional closing and opening operations of mathematical morphology may be used. A description may be found in the book by J. SERRA "image analysis and mathematical morphology" 1982, Academic Press, London.

If a confusion between colours is possible between an obstacle and points of the target panel which is masked by the obstacle, discrimination of the obstacle may be significantly improved by using a camera which operates in at least four spectral bands, rather than three spectral bands, thereby extending the capability of measuring the radiometric values of the points of the observed scene beyond the eye-visible spectrum. The method is already used for analysing plant cover on views taken from satallites: the fourth band is often in the near infra-red.

REPLACEMENT

Once these operations have been carried out, all data necessary for replacing the target panel by the pattern panel are available.

Replacement requires successively (a) the processing of the stored pattern panel, then (b) substitution of the pixels obtained for those of the image delivered by the camera and possibly stored.

(a) Processing implies carrying out on the image of the pattern panel, at a scale corresponding to that used for identifying the target panel, the same metric, colorimetric and possibly image motion transformations; then the transparency function is applied (identity in the case of transparency equal to 1, suppression in the case of zero value, attenuation in the case of semi-transparency). The result obtained is inlaid in the current image before storage or broadcasting.

Time filtering may be purely predictive and only take past images into account. It may use partially or totally a-posteriori or recursive filtering, provided that a "rotating" memory forming a delay line is used, whose capacity is a few images.

Time filtering, if used, requires detecting each change of shot which results in a discontinuity of the transformation, which is steady and continuous in the same shot.

The shot changes may be indicated by a sensor equipping the central control. It is also possible--and this is the solution which will generally be adopted--to automatically detect each change of shot by the discontinuity which it introduces into the contents of the image of the observed scene.

For example, in the case of zooming, there will be a re-estimation by searching the representation scale of the target panel, which causes the representation to match best with the target panel in the current scene. This operation may be carried out by a correlation method applied to lists of characteristic points of the type called "branch and bound". Then the above-described procedure is resumed. The estimations of the shooting elevation and lateral deflection angles are also compared, which allows to determine which camera takes the new shot, provided that it has not moved since the preceding shot which it acquired. The operations to be carried out are illustrated on FIGS. 3A and 3B.

The particular embodiment which has just been described is well-adapted to the case of a target formed by a flat surface. The invention can also be used without significant modification in the case of a target whose surface may be broken up into facettes which may be considered as flat portions, as is currently done in image synthesis.

Break down of the target, on the one hand, and of the pattern, on the other hand, into several facettes may be carried out separately and without any mutual relation.

Computer 22 may have the schematic construction illustrated in FIG. 1, some of the components possibly having the same nature as those described in document EP-A-0 360 576 already mentioned. It comprises a large capacity image memory 24 (hard disk for instance) delivering each image in turn to a signal processor 26. The processor is programmed for identifying and locating the characterizing points of the target, stored as neighbourhoods in a memory 28. The processor delivers, to a comparator 30, the elements which are necessary for selecting the appropriate representation of the target zone in a memory 28 and transfer the representation into a buffer memory 32. A pixel per pixel comparator 34 makes it possible to identify the parts of the target which are masked by an obstacle and to deliver, to a combination circuit 36, the pixels which are necessary for replacing the pixels which are not masked with respective pixels of the appropriate representation of the pattern, read from a memory 38. The resulting image may be stored in a mass memory 40 and displayed on a monitor 42.

FIGS. 4A to 4E illustrate, in detail, the steps of a process in the particular case of a scene which is observed by a single camera having a variable focal length and apt to rotate.

FIG. 4A illustrates preparation of the target model from a magnetoscope, with initialization, for each shot, by designation (for instance with a graphic tablet and a stylus) two diagonally opposite points of the target.

FIG. 4B indicates acquisition and localization of a target panel.

FIG. 4C illustrates steps which are only necessary when the colour transformation should be estimated and when obstacles or shadows should be detected.

FIG. 4D indicates the steps of laying-in the pattern model at the appropriate scale into the image.

Last, FIG. 4E indicates the steps for storing on a magnetoscope. 

We claim:
 1. Device for modifying the content of images of a sequence of images having a high degree of correlation between successive images, comprising picture forming means delivering each image as of a matrix of pixels, the radiometric values (luminance and possibly chrominance) of each pixel being in digital form, characterized in that it further comprises:means for storing reference marks for locating a target area in the image and representations of the target area at a plurality of scales and for storing representations, at the same scales, of a pattern which can be superimposed on the target area, means for recognizing the reference marks in the current image and deriving therefrom the position and dimensions of the target area, and means for automatically determining, for each image, by spatial correlation calculation based on the reference marks, a geometrical transformation to which the pattern should be subjected to correspond to the variations of appearance of the target area, in all successive images of a same shot and for substituting, in the image, the pixels representive of the pattern for the pixels representative of the target area.
 2. Method for modifying the content of a sequence of images of the same shot, representing an evolutionary scene and having a high degree of correlation between successive images of said same shot, the images being represented as a material arrangement of pixels, comprising the steps of:(a) identifying a non-deformable target zone to be replaced in the scene by a set of reference marks distributed within said target zone; (b) in each current one of said images, searching for said current image and recognizing said set of reference marks in said current image by pattern recognition process; (c) identifying the position and dimensions of said target zone in said current image by referring to said reference marks; (d) determining a position, with respect to said reference marks, to give to a pattern superimposable on the target zone, stored in digital form as pixels; (e) for each said current image of said sequence, automatically computing a geometrical transformation to be undergone by the pattern to make it match the variable appearance of the target zone by a correlation based on the reference marks, in all successive images of the same shot; and (f) substituting the pixels representative of the pattern in the image for the pixels representative of the target zone.
 3. Method according to claim 2, further comprising:storing a plurality of representations of the target zone at different scales and possibly at different angular positions; storing a plurality of representations of the pattern, each representation of the pattern being at the same scale and the same angular position as a respective one of the representations of said target zone; determining the optimal geometric transformation for passing from the target zone in a current image of said sequence of images to a selected one of said representations of said target zone; and subjecting that representation of the pattern associated to the selected representation of the target zone to said optimal geometric transformation, before substitution thereof.
 4. Method according to claim 3, further comprising determining the optimal colour transformation for passing from the target zone of the current image to one of the representations for impressing it to the associated representation of the pattern or model.
 5. Method according to claim 4, wherein said optimal colour transformation is determined by carrying out a plurality of different transforms and comparing the results thereof.
 6. Method according to claim 3, wherein the optimal geometric transformation is determined by carrying out a plurality of different transforms and adopting that synthesis of these transforms which gives best results.
 7. Method according to claim 3, further comprising the step of detecting the presence of obstacles between a shot taking apparatus delivering the sequence of images and the target zone by comparing the selected representation of said target zone after said geometric transformation and the target zone in the current image, wherein the pixels which represent the pattern are substituted only in those portions of the target zone which do not present a significant difference.
 8. Method according to claim 3, wherein the optimal geometric transformation is determined by carrying out a plurality of different transforms and comparing the results thereof.
 9. Method according to claim 2, further comprising carrying out a time filtering on the mathematical transform models.
 10. Method for modifying the content of a sequence of images of the same shot, representing an evolutionary scene and having a high degree of correlation between successive images of said same shot, by substituting a representation of variable appearance of a non-deformable target zone in each said image with a representation of a predetermined pattern, comprising the preliminary steps of:storing a pattern apt to match said target zone; defining said target zone by a set of characterizing data distributed within said target zone and having an unchanging mutual relation to said target zone; storing said set of characterizing data; and, in each of said successive images of said shot, the steps of: searching said image for and recognizing said set of characterizing data; carrying out a correlation calculation based on said characterizing data to find a position of said pattern and a geometrical transformation to impress to said pattern to obtain a representation matching the variable appearance of the target zone; geometrically transforming said pattern using results of the correlation calculation; and substituting the representation of the target zone in each said successive image with the respective geometrically transformed representation of said pattern.
 11. Method for modifying the content of a sequence of images of a same shot, consisting of a matrix arrangement of pixels, representing an evolutionary scene and having a high degree of correlation between successive images, by substituting a representation of a non-deformable target zone in each said image with a predetermined pattern, said method comprising the steps of:(a) storing, as pixels, a pattern apt to match said representation in digital form; (b) defining said representation of said target zone as a set of characterizing data distributed within said target zone and having an unchanging mutual relation in said target zone; (c) storing a full representation of said target zone and said set of characterizing points; (d) searching each said successive image of said shot for and recognizing said set of characterizing data; (e) for each image of said sequence where said set of characterizing data is recognized, carrying out a correlation calculation based on said characterizing data to find a position to give to said pattern and a geometrical transformation to impress to said pattern to make it match the variable appearance of the target zone; (f) geometrically transforming said pattern using results of the calculation; and (g) substituting the pixels representing the target zone in said image with pixels representing the respective geometrically transformed pattern.
 12. Method for modifying the content of a sequence of images of a same shot, representing an evolutionary scene and having a high degree of correlation between successive images of said same shot, by substituting a non-deformable target in each said image with a predetermined pattern, said method comprising the steps of:(a) storing a plurality of representations, at different scale, of a pattern apt to match said target; (b) storing a plurality of representation of said target at the same scales; (c) defining each said representation of said target as a set of corresponding characterizing points distributed within said target and having an unchanging mutual relation in said target; (d) storing said sets of characterizing points; and, in each successive image of said shot, (e) searching for said sets of characterizing points in each said image; (f) carrying out a correlation calculation based on said reference marks to find an optimum one among said representations of said target zone, and a geometrical transformation to impress to the corresponding optimum representation of said pattern to make it match the variable appearance of the target zone; (g) geometrically transforming said optimum representation of said pattern using results of the correlation calculation; and (h) substituting the representation of the target zone with the respective geometrically transformed pattern representation.
 13. Device for modifying the content of video images of a sequence of video images having a high degree of correlation between successive images, each said image being represented by a matrix of pixels, said device comprising:first means for storing a set of reference marks identifying a non-deformable target zone present in some at least of the video images of said sequence and for storing a plurality of representations of said target zone at a plurality of scales, said reference marks having a fixed relation with and being close to said target zone; second means for storing a plurality of representations, at the same scales as said representations of the target zone, of a pattern having a shape enabling to superimpose it on the target zone; third means for searching for and recognizing said set of reference marks in a current image of said sequence and deriving therefrom the position and dimensions of the target zone, using said representations of said target zone at different scales; fourth means for automatically computing, for each image, by spatial correlation calculation based on the reference marks, a geometrical transformation to which one respective of said representations of the pattern should be subjected to match the target zone in said current image; and fifth means for subjecting said respective representation to said geometrical transformation and for inserting, in the current image, pixels representative of the geometrically transformed pattern for the pixels representative of the target zone.
 14. Device according to claim 13, wherein said third means are arranged to search for said set of reference marks in a part of the current video image which is derived from that part of a preceding image where said set was previously found based on constraints in geometrical evolution.
 15. Device for replacing a representation of a nondeformable target with a matching representation of a predetermined pattern in successive video images of a sequence of images having a high degree of correlation between successive images, said device comprising:means for storing reference marks identifying said target, said reference marks being distributed at least within said target; means for storing at least a representation of said pattern; means for detecting said target in each of said images of said sequence and deriving the position, size and orientation of the representation of the target, based on a search for a best possible match between said stored reference marks and respective reference marks extracted from the images; means for geometrically transforming said representation of said pattern to match the respective representation of the target; and means for substituting, in each said image of said sequence where the target is found, the geometrically transformed representation of said pattern for the representation of the target.
 16. Device according to claim 15, wherein said reference marks further include elements surrounding said target and having an invariable position with respect to said target in a scene.
 17. Device according to claim 15, further comprising means for subjecting the selected stored representation to said geometrical transformation and determining that part of the geometrical transformation of said target representation which is identical to a part of said target as it appears in said current image, wherein said means for substituting the geometrically transformed pattern representation only carries out the substitution in that part of the target for which identity is found.
 18. Device for modifying the content of video images of a sequence of images, originating from a TV camera having a variable focal distance and orientation, without sudden time or space disruption, comprising:first means for storing a plurality of representations of a predetermined target present in video images of said sequence at a plurality of scales and for storing a plurality of reference marks identifying said target, said reference marks being distributed within said target; second means for storing a plurality of representations, at the same scales as said target, of a pattern having a shape fit to be superimposed on the target; means for recognizing at least some of said reference marks in a current image of said sequence, deriving therefrom the position and dimensions of edges of the target zone by comparing them to said plurality of representations of said target at different scales and selecting that stored representation of said target which best matches the target found in the image; means for computing, for each image, based on said reference marks, a mathematical model which best represents a geometrical transformation to which the stored representation of the pattern at the same scale as the matching representation of the target should be subjected to match the respective representation of the target, in said current image of said sequence; and means for subjecting the latter cited representation of the target to said geometrical transformation and for substituting, in the current image, the geometrically transformed pattern representation for the target.
 19. Device according to claim 18 for replacing a target zone formed by an advertising billboard containing alphanumeric characters, wherein said reference marks include graphemes of said characters each defined in a neighborhood of pixels.
 20. Device according to claim 19, wherein said neighborhood is of 6×8 pixels.
 21. Device according to claim 19, wherein said reference marks further include edges of said billboard.
 22. Device according to claim 18, wherein said plurality of representations stored in said first means are at scales derived from each other by multiplication by a root of two.
 23. Device according to claim 18, wherein said stored representations of said target zone and said pattern comprises a plurality of representations at different perspective angles.
 24. Apparatus for replacing a target in a sequence of successive video images of a scene, delivered by a video camera, with a stored pattern, comprising:means for storing a pattern and for storing representation of said target as a set of a plurality of characterizing data distributed in said target; means for automatically recognizing said set of characterizing data in each current image in a sequence of video images of a scene wherein the images have a high degree of correlation between successive images without human intervention and deriving an estimated location, size and perspective of said target in said current image from the relative location of the characterizing points in said current image; means for geometrically transforming said stored pattern responsive to said estimated size and perspective to make said pattern match said target in said current image; and means for inserting said geometrically transformed pattern into said current image in substitution of said target.
 25. Apparatus according to claim 24, further comprising means on said video camera indicating variations of the focal length of said camera for assistance in recognition of said set of characterizing data.
 26. Apparatus according to claim 24, wherein said means for automatically recognizing said set of characterizing data are arranged to search for said set in a part of each successive video image which is derived from that part of a preceding one of said video images where said set was previously found based on constraints in evolution.
 27. Apparatus according to claim 24, wherein said means for storing a representation of said target are arranged to further store a full representation of said target and wherein said apparatus further comprises means for geometrically transforming said representation of said target responsive to said estimated size and perspective, for determining which part of the geometrically transformed representation of the target is identical to a part of the target as it appears in said current image, and wherein said means for substituting the geometrically transformed pattern only carry out the substitution in that part of the target for which identity is found between the current image and the transformed representation of said target.
 28. Apparatus according to claim 24, wherein said means for storing a representation of said target are arranged to further store a full representation of said target and wherein said apparatus further comprises means for determining an optimal color transformation for passing from the target as seen in said current image to said full representation of said target and for impressing said optimal color transformation to said geometrically transformed pattern prior to insertion of said geometrically transformed pattern into said current image.
 29. Method for replacing a target in a sequence of successive video images of a scene, delivered by a video camera, with a stored pattern, comprising the steps of:(a) storing a representation of said target as a plurality of characterizing points distributed in said image; (b) automatically recognizing, by pattern recognition processes, at least a subset of said set of characterizing points in each current image in the sequence and deriving an estimated location, size and perspective of said target in said current image from the relative location of the characterizing points in said current image; (c) geometrically transforming said stored pattern responsive to said estimated size and perspective to make said transformed pattern match said target in said current image; and (d) inserting said geometrically transformed pattern into said current image in substitution of said target.
 30. Method according to claim 29, further comprising:during step (a), storing a full representation of said target; geometrically transforming said representation of said target responsive to said estimated size and perspective; determining, by pixel-by-pixel comparison, which part of the geometrically transformed representation of the target is identical to a part of the target as it appears in said current image; wherein, during step (d), only a part of the target for which identify is found between the current image and the
 31. Method according to claim 30, wherein said comparison is carried out for further determining a degree of transparency of semi-transparent obstacles and the geometrically transformed pattern is inlaid with a degree of attenuation related to said degree of transparency.
 32. Method according to claim 31, wherein time filtering is applied to said semi-transparent obstacles.
 33. Method according to claim 29, further comprising deriving a signal indicating variations of the focal length of said camera from said video camera and using it for assistance in recognition of said set of characterizing points.
 34. Method according to claim 29, wherein, said set of characterizing points is searched only in a part of each successive video image which is derived from a part of a preceding one of said video images, where said set was previously found, selected on the basis of constraints in evolution between successive images.
 35. Method according to claim 29, further comprising:during step (a), storing a full representation of said target; determining an optimal color transformation for passing from the target as seen in said current image to said full representation of said target; and impressing said optimal color transformation to said geometrically transformed pattern prior step (d).
 36. Method according to claim 29, wherein step (a) further comprises storing characterizing data distributed out of said target in addition to characterizing data within said target.
 37. Method according to claim 29, further comprising a preliminary step of manually indicating said characterizing points to be stored as neighborhoods of pixels.
 38. Method according to claim 29, wherein said characterizing data include significant graphemes of alphanumerical characters within said target.
 39. Method for replacing a target in a sequence of successive video images of a scene, delivered by a video camera, with a stored pattern, comprising the steps of:(a) storing a representation of said target as a plurality of distributed characterizing points image and a full representation of said target; (b) automatically recognizing at least a subset of said set of characterizing points in each current image in the sequence and deriving an estimated location, size and perspective of said target in said current image from the relative locations of the characterizing points in said current image; (c) geometrically transforming said representation of said target responsive to said estimated size and perspective; (d) determining, by pixel-by-pixel comparison, which part of the geometrically transformed representation of the target is identical to a part of the target as it appears in said current image; (e) geometrically transforming said stored pattern responsive to said estimated size and perspective to make said transformed pattern match said target in said current and said geometrically transformed pattern into said current image in substitution of said target; (f) substituting only a part of the target, for which, during step (d), identity was found between the current image and the transformed representation of the target, with a matching part of said geometrically transformed pattern.
 40. Method according to claim 39, further comprising the steps of:determining an optimal color transformation for passing from the target as seen in said current image to said full representation of said target; and impressing said optimal color transformation to said geometrically transformed pattern prior to step (f).
 41. Method according to claim 39, wherein step (a) comprises storing characterizing points distributed out of said target and within said target.
 42. Method according to claim 39, wherein said characterizing points are stored as neighborhoods of pixels.
 43. Method according to claim 39, wherein said characterizing points include significant graphemes of alphanumerical characters within said target. 